A Blanket Binarization Method for Character String Extraction

نویسندگان

  • Hiromi Yoshida
  • Naoki Tanaka
چکیده

In this paper, a binarization method based on fractal dimension for character string extraction is proposed. In character extraction from a scene image, a major problem is how to deal with much different type of characters in a complex background. The proposed method can obtain multiple threshold values which are correspond to each character regions by detecting the stable intervals of fractal dimension FD. The stable interval is a relatively low and flat valley of the FD which indicates the binarized image has the stable connected regions, and therefore fine character regions have been appeared. The character regions may contain some noise and has conflictions between the regions derived with another threshold values. We call these character region as a ”Candidate Character Region Images”(CCRI), and will be processed by noise-reduction consists of two steps. After that, CCRI are integrated into one binarized image as output image through the contention resolution process. We show the performance of the proposed method by comparing Niblack’s method as a local method and Otsu’s method as a global method on the dataset provided at ICDAR 2003.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone ...

متن کامل

An Effective Edge and Texture Based Approach towards Curved Videotext Detection and Extraction

In present day video text greatly helps video indexing and retrieval system as they often carry significant semantic information. Video text analysis is challenging due to varying background, multiple orientations and low contrast between text and non-text regions. Proposed approach explores a new framework for curved video text detection and recognition where from the observation that curve te...

متن کامل

Using Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Image

Compared to binary images that most text extraction methods work on, gray scale images provides much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images ...

متن کامل

Using Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Images

Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images a...

متن کامل

Text Extraction and Text Binarization Algorithms

In the conventional method, grey image binarization processing with a given threshold is employed to extract high intensity video character regions. A corner based approach to detect text and caption from videos is presented in [47]. This approach is inspired by the observation that there exist dense and orderly presences of corner points in characters, especially in text and caption. The usage...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011